Differentiable channel pruning guided via attention mechanism: a novel neural network pruning approach

نویسندگان

چکیده

Abstract Neural network pruning offers great prospects for facilitating the deployment of deep neural networks on computational resource limited devices. architecture search (NAS) provides an efficient way to automatically seek appropriate design compressed model. It is observed that, existing NAS-based methods, there usually a lack layer information when searching optimal architecture. In this paper, we propose new NAS approach, namely, differentiable channel method guided via attention mechanism (DCP-A), where adopted able provide guide optimization policy. The training process with Gumbel-softmax sampling, while parameters are optimized under two-stage procedure. block shortcut dedicatedly designed, which help prune not only its width but also depth. Extensive experiments performed verify applicability and superiority proposed method. Detailed analysis visualization pruned model shows that our DCP-A learns explainable policies.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Neural Network Pruning and Pruning Parameters

The default multilayer neural network topology is a fully interlayer connected one. This simplistic choice facilitates the design but it limits the performance of the resulting neural networks. The best-known methods for obtaining partially connected neural networks are the so called pruning methods which are used for optimizing both the size and the generalization capabilities of neural networ...

متن کامل

Breast Cancer Diagnostic Factors Elimination via Evolutionary Neural Network Pruning

The redundant medical diagnostic factors elimination problem was faced by the use of Artificial Neural Networks that were evolved using Genetic Algorithms. For specific medical diagnosis problems such as breast cancer classification, with given set of diagnostic input parameters, Genetic Algorithms were used for pruning Neural Network structure and the investigation of the most appropriate subs...

متن کامل

Structured Deep Neural Network Pruning via Matrix Pivoting

Deep Neural Networks (DNNs) are the key to the state-of-the-art machine vision, sensor fusion and audio/video signal processing. Unfortunately, their computation complexity and tight resource constraints on the Edge make them hard to leverage on mobile, embedded and IoT devices. Due to great diversity of Edge devices, DNN designers have to take into account the hardware platform and application...

متن کامل

An Evolutionary Bootstrap Approach to Neural Network Pruning and Generalization

This paper combines techniques drawn from the literature on evolutionary optimization algorithms along with bootstrap based statistical tests. Bootstrapping is used as a general framework for estimating objectives out of sample by redrawing subsets from a training sample. Evolution is used to search the large number of potential network architectures. The combination of these two methods create...

متن کامل

Adaptive Predictive Controllers Using a Growing and Pruning RBF Neural Network

An adaptive version of growing and pruning RBF neural network has been used to predict the system output and implement Linear Model-Based Predictive Controller (LMPC) and Non-linear Model-based Predictive Controller (NMPC) strategies. A radial-basis neural network with growing and pruning capabilities is introduced to carry out on-line model identification.An Unscented Kal...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Complex & Intelligent Systems

سال: 2023

ISSN: ['2198-6053', '2199-4536']

DOI: https://doi.org/10.1007/s40747-023-01022-6